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Motivation: 

Complexity lower bounds and independence results are easier in a relativized setting. It 
seems reasonable to define a relativized setting for the propositional calculus in order to 
prove lower bounds. 

CN '. Syntax: 

Q , The language of PC(R) (propositional calculus relativized to R) consists of formulas built 

CN [ from atoms p, q, r, ... using the usual connectives 0,1, A, V, -i, together with the relation symbol 

H I R. The usual formation rules for formulas apply, but in addition we agree that 

ii Ai, ...,An are formulas, n > 0, then R{Ai, ..., An) is a formula 



X 



Semantics: 

A structure r consists of an assignment of a truth value p'^ in {0, 1} (where 1 = TRUE and 
= FALSE) to each atom p, together with a set K^ C {0, 1}* of binary strings. Then each 



U 
U 

Q ■ formula A of PC(R)gets a truth value A'^ G {0, 1} in the obvious way. In particular 

R{A,,...,Any = 1 ^^ AI...AI e R^ 

> 

00 

\Q • This syntax and semantics is essentially the same as that defined by Ben-David and Gringauze 

. [3]. (See also [1].) 
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We say that A is valid iS A'^ = 1 for all structures r, and A is satisfiable iS A'^ = 1 for some 
structure r. 

For example, 

iRip)ARi^p))DiRiq)WRi^q)) 

is valid. In general, A is valid iff -lA is unsatisfiable. 

Theorem 1: The satisfiability problem for PC(R)formulas is in NP (and hence NP- 
complete) . 

Proof: A certificate for satisfiability need only specify r for each atom in A, and for each 
occurrence of the form R{Bi, ..., i?„) in A, some string f i...f„ G {0, 1}" is specified to either 
be in R'^ or not in i?^. D 

System PK(R): 

This is Gentzen's sequent system PK for the propositional calculus (see for example [4] or 
[5]), except formulas are allowed to be PC(R)formulas, and in addition to the axiom scheme 
A ^ A, and the axioms — )■ 1 and — )■, we add the axiom scheme 



^This is a slight revision of a working paper from June 4, 2003. Much of this material was presented at 
the complexity theory workshop at Overwolfach, 30 April, 2003. 



AX: -^AW B,Ay^B,R{C,A,D) -^R{C,B,D) 

which asserts that if A and B are equivalent, then one can be substituted for the other as 
an argument of R. 

Using AX, each of the following four schemes E1,E2,E3,E4 has a PK(R) proof with a 
constant number of sequents: 

El) A, R{C, A, D) -^ R{C, 1, D) 
E2) A, R{C, 1, D) -^ R{C, A, D) 
E3) R{C, A, D) -^ A, R{C, 0, D) 
E4) R{C, 0, D) -> A, R{C, A, D) 

Theorem 2: PK(R) is sound and complete. Further every valid sequent S has a PK(R) 
proof n with 0(2l'^l) sequents, where each sequent in tt has length Od^l), where l-SI is the 
total number of symbols in S. 

Remark: In counting the number of sequents in a proof, we do not count weakenings and 
exchanges. 

Proof: Soundness asserts that every sequent derivable in PK(R) is valid. This is true 
because the axioms are valid and the rules preserve validity. 

Completeness asserts that every valid sequent has a PK(R) proof. To get an upper bound 
on the number of lines in the proof, we make the following definition: 

Definition 1: The cost c{A) of a formula A is the number of occurrences of A, V, -> in A plus, 
for each subformula R{Bi, ...,Bn) in A, the number of formulas in the sequence Bi, ...,Bn 
other than or 1. The cost c{S) of a sequent S is the sum of the costs of the formulas in 
the sequent. 

For example, the cost of R{p Aq,p Aq,p, 0, 1, 1) is 5: 2 for the two occurrences of A, and 3 
for the three nontrivial arguments of R. 

Note that c{A) < \A\, where \A\ is the number of symbols in A, counting commas. 

Lemma 1: For some constant d, each valid sequent S has a PK(R) proof with at most 
(i2'^(s) lines, where each line has length OdS*!). 

Proof: Induction on c{S). 

Basis: Suppose that c(r — )■ A) = and F — > A is valid. Then any occurrence of R must be 
as a formula of the form R{vi, ..., f„) in one of the sequences F or A, where each Vi is either 
or 1. It is easy to check that either 1 is a formula in A, or is a formula in F, or some 
formula occurs in both F and A. In each case, F — )> A can be derived from an axiom (other 
than AX)) by weakenings and exchanges. 

Induction Step: c(F — )■ A) > 0. Then some formula in either F or A must have a 
principal connective that is either A, V, -> or R. For the cases A, V, -> we derive F — )> A by 



the appropriate PK introduction rule (left or right), thus reducing the problem to deriving 
one or two valid sequents, each of reduced cost, so the Induction Hypothesis applies. 

Now suppose that F — )■ A has the form 

r ^A',R{C,A,D) (1) 

where A is not or 1. Then we use the derivation below, based on E2 and E4 above, where 
we have omitted weakenings and exchanges. All indicated inferences use the cut rule. 

AT'^ A',i?((7,l,z3) E2 r ^A',A,R{C,0,D) EA 
A, r' ^ A', R{C, A, D) r -> A', A, R{C, A, D) 



V ^ A',R{C,A,D) 

This reduces the proof of ([1]) to the proof of two valid sequents, each of which has cost one 
less than the cost of ([H). The induction hyposthesis applied to these two sequents gives us 
the desired result. 

The remaining case to consider is that F — !■ A has the form 

r',R{C,A,D) ^ A' 

where again A is not or 1. This time we use the derivation below, using El and E3: 

A,R{C,l,D),r' ^A' El R{C,0,D),T' ^A',A E3 

A,R{C,A,D),r' -^A' R{C,A,D),r' ^A',A 

r,R{C,A,D) ^ A' 

Now we apply the induction hypothesis, as in the previous case. 

Quantified Relativized Prepositional Calculus 

Formulas in QPC(R) are like those in PC(R), except we now allow quantifiers Vx and 3x, 
for an atom x. The semantics are obtained in the obvious way by letting x range over {0, 1}. 

Notation: n^(-R) is the class of formulas of QPC(R) of the form 

VxA(x, p, R) 
where A is quantifier-free. 

Theorem 3: The satisfiability problem for QPC(R) is complete for NEXP. The same is 
true for the satisfiability problem restricted to n^(i?) formulas. 

Proof: It is easy to see that the satisfiability problem is in NEXP: Given a formula A of 
QPC(R), let n be the largest number of arguments of any occurrence if R in A. Guess at 



a structure r for A by writing down truth values to the free variables of A, and specifying 
K^ for R up to strings of length n by writing down a subset of {0, 1}-". Now verify that r 
satisfies A. 

Hardness can be established either by a direct reduction of Turing machine computations 
to QPC(R) satisfiability (proof due to Charles Rackoff), or by using the proof that succint 
circuit satisfiability is NEXP complete (see page 494 of Christos Papadimitriou's textbook 
on Computational Complexity) (proof due to Tsuyoshi Morioka). 

Notation: \A\ denotes the length of a formula A; that is, the total number of occurrences 
of symbols in A. 

Note that if there are many different variables occurring in A then the binary length of A 
could be as more like | A \ log \A\. 

Lemma lA: (with Rackoff) For every nondeterministic TM M there is a polytime transfor- 
mation Fm such that for all x G {0, 1}*, Fm{x) is a n^(-R) formula, and \Fm{x)\ = 0{\x\), 
and 

A = Fm{x) is satisfiable <^=^ M accepts x in at most 2^^^ steps 

Proof Outline: The proof is like that of the Cook-Levin Theorem. Let Co, C2, ..., Ct be a 
computation of T = 2" steps of M on input x, where n = \x\. Here Ci is a bit string of length 
0(2") coding the configuration of M at step i. Thus the computation can be represented by 
a relation R C {0, l}*^*^"), where R{p, q) represents bit p of C^. 

Then Fm{x) is the prenex form oi S A I A E where 

S asserts that the computation starts right 

/ asserts that the computation increments right 

E asserts that the computation ends right 

The formula E is easy, since it merely asserts that the configuration Ct is in an accepting 
state. 

The formula S asserts that the inital configuration, coded by R{p, 0) (as p ranges over all 
possible values), represents a tape configuration consisting of x followed by blanks, and the 
initial state. 

To see how to express this with a formula of length 0{n) we assume for simplicity that 
X = Xi...Xn is a bit string over {0, 1}. We show how to construct a formula Si{p) of length 
0{n) which asserts that for i = 1, ...,n if p represents i in binary then {R{p,0) ^ Xi). This 
explains the interesting part of the construction of S. 

To see how to construct 5*1, let k = \\og2{n + 1)] and suppose pi,...,pk represent the k 
low-order bits when p represents a binary number i. (When i < n, then the reamaining bits 
Pk+i, ■■■,Pcn are 0.) Consider a Boolean circuit a with inputs pi, ...,pk and outputs ri, ...,r„ 
such that Tj = 1 iS pi...pk represents i in binary. Note that a can be constructed with 0{n) 
gates by a simple recursion on k. 



Let (3{pi, ...,pk,g,r) be a (quantifier-free) prepositional formula of length 0{n) which holds 
iff the circuit a with input values pi, ...,pk takes on values ^for its internal gates and values 
r for its output gates. Then Si{p) is the formula 

V^f[(/3(pi, ...,Pk,g,r) A Z{p)) -^ [R{p,0) ^ ((ri A Xi) V ... V (r„ A a;„))]] 

where 

Z{p) = -^pk+i A ... A-.pcn 

It remains to discuss the formula /. This asserts that for all t < 2", Ct+i is the successor 
configuration to Ct (when Ct and Ct+i are represented by R.) Given a reasonable repre- 
sentation of the Turing machine configurations, it is straightforwward to construct such a 
nf (i?)-formula I of length 0{n). D 

Corollary: There is no proof system for the valid formulas of QPC(R)(or for the valid 
S^(i?) formulas) with the property that every valid formula A has a proof P such that 

\P\ = 2°(l^') (2) 

where |-P| is the bit length of P. 

Proof of the Corollary: We use the following 

Fact: There is a universal nondeterministic TM Mq such that for every nondeterministic 
TM M and all sufficiently large strings x which code M, 

Mq accepts x within 21^' steps <^=^ M accepts x within 2°'^l^l steps 

Let Co be a constant such that, referring to Lemma lA, 

\Fmo{x)\ < Co|x|, for all sufficiently long x 

Now suppose n is a proof system for unsatisfiability which violates the Corollary, so every 
unsatisfiable S^(i?) formula A has a proof P satisfying ([2]). Let Mi be a nondeterministic 
TM which on input x computes A = Fmo{x), guesses a proof P, and accepts iff P is a n 
proof of A (showing that A is unsatisfiable). Let S = 0.2/ cq and let Xi be a sufficiently long 
string coding Mi. Then 

Ml accepts xi within 2°'^l^'^l steps 

^^ there is a H proof P oi A = Fmo(xi) where \A\ < co\xi\ and \P\ < 2'^l^l < 2°-2|^'il 

<^=^ A is unsatisfiable 

<^==^ Mq does not accept xi within 2l^'il steps 

<^=^ Ml does not accept xi within 2'^l^il steps. 

This is a contradiction. D 

System G(R): 

This is the system G of quantified propositional calculus described in section 4.6 of Krajicek's 



book [5], extended so that formulas are allowed to be QPC(R) formulas, and we allow the 
axiom scheme AX above. In other words, G(R) is obtained from PK(R) by extending the 
definition of formula, and allowing the four quantifier rules of LK(Krajicek, page 58). 

Theorem 4: G(R) is sound and complete. 

Proof: Soundness is easy, since as before the axioms are valid and the rules preserve validity. 

We prove that every valid sequent has a G(R) proof by double induction, first on the 
maximum quantifier depth of formulas in the sequent, and second on the cost c{S) of the 
sequent, as defined in Definition 1 above. 

To see how to reduce the quantifier depth, consider the case 

r -^ A', 3xA{x) 
This can be derived by two applications of 3-right and one of contraction from 

r'^A',A(o),A(i) 

and this sequent is valid if the previous one is valid. D 

Remark: It seems that the obvious upper bound for the above proof length is doubly 
exponential, even in the case of nonrelativized G, and even for the case nonrelativized Gi. 

Consider the example 

^ 3xi.. 3XniAi A ... A Ara) 

If we apply the above method to get rid of the existential quantifiers, we obtain a sequent 
with 2" formulas, each of which is a conjunction of m formulas. Now to unwind all of these 
conjunctions in the usual way seems to generate 2™"^" sequents. 

Rackoff points out that this large upper bound is not surprising for the relativized case. In 
fact, if a simply exponential upper bound could be found, it would follow from Theorem 3 
that NEXP= coNEXP. 

However there is a simply exponential upper bound for the nonrelativized case. 

Theorem 5: (See Theorem VII. 3. 9 in [4].) Every valid sequent S of QPC (with no R) has 
a tree-like G proof with 0{2^^^) sequents (not counting weakenings and exchanges), where 
each sequent has length 0(15*1) and all cut formulas are atomic. 

Work to be done: 

• Carry out the translations of the relativized theories S2{R) and T2{R) into QPC(R). It 
may be easier to translate the two-sorted versions V^{R) and TV'^{R). (The theories V^ and 
TV^ are presented in [4], where prepositional translations are given.) 

• Once the translations have been written down, it should be possible to describe families of 
valid QPC(R) formulas corresponding to various search problems, and prove lower bounds 
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on their G(R) proof lengths by the same search problem separations used to separate various 
relativized theories of bounded arithmetic. 

Example: Let WPHP(i?, n) be a relativized propositional formula (in fact a ^1{R) for- 

2 

mula) representing the weak pigeonhole principle PHP" as follows. (Here we assume that 
p and g'are vectors of 2n variables, while r*and s are vectors of just n variables.) 

WPHP(i?, n) = 3p3g3f[{p^ g A R{p, r) A R{q, f)) V Vs^i?(p, s)] 

Conjecture 1: (WPHP(_R, n)) does not have polysize G2(R) proofs. 

Proposed Proof Outline: 

(i) Theorem 11.3.1, page 220 of Krajicek's book shows that the witnessing problem for 
WPHP (i?,n) is not in FP^^^^\ 

(ii) The witnessing problem for G2(R) proofs of S2(-R) formulas is in pp^^i^) , This is by 
analogy with the fact that the witnessing problem for G^(R) proofs of T![{R) formulas is in 
FP{R). 

(iii) If (WPHP(i?, n)) has polysize G2(R) proofs, then given n we could use an NP oracle 
to find a proof of WPHP(i?, n)), and then use (ii) to solve the witnessing problem with an 
NP{R) oracle. This contradicts (i). D 

In the same vein, we know (by translations into bounded depth Frege systems) that 

S2{R) \/PHP{R) 

(see Pitassi's thesis). This suggests 

Conjecture 2: (PHP(i?, n)) does not have polysize Gi(R)-proofs, for any i. 

Apparently we can translate theorems of 5*2 (-R) both into quasipolysize families of bounded 
depth Frege proofs, and into polysize families of G(R) proofs. This leads to 

Conjecture 3 (Pudlak): Find an RSUV style isomormphism between AC^-Frege and G(R). 

In a slightly different vein, we have 

Conjecture 4: (Morioka:) The ITER(R) Tautologies do not have polysize G^(R) proofs. 

Proposed Proof (Morioka): Prove a superpolynomial lower bound for the circuit size for 
solving ITER(R). 

• Think about using the oracle separations of NC and P in [2] to separate relativized G*(R) 
and Gi(R). 

• (Far out:) Try for lower bounds for unrelativized G. Of course there's no super proof 
system for QPC (including G) under the assumption NP ^ PSPACE. Can we get a lower 
bound for G proofs under the weaker assumption P ^ PSPACE? 
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